Conformance Tests for SEP-2322 MRTR by CaitieM20 · Pull Request #188 · modelcontextprotocol/conformance

CaitieM20 · 2026-03-17T23:17:18Z

Draft Conformance tests for the SEP-2322: Multi Round-Trip Requests

Also added code to client-helper.ts to make rawMCP Requests (i.e. basic json requests) this will be generally useful for draft features that may not have reference implementations yet.

Motivation and Context

See SEP

How Has This Been Tested?

Conformance Tests & Reference Implementation in progress work

Breaking Changes

yes see SEP

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

Checklist

I have read the MCP Documentation
My code follows the repository's style guidelines
New and existing tests pass locally - existing tests pass draft tests do not since we don't have an implementation yet
I have added appropriate error handling
I have added or updated documentation as needed

Additional context

…ry wrapper class

pkg-pr-new · 2026-03-17T23:17:47Z

Open in StackBlitz

npx https://pkg.pr.new/@modelcontextprotocol/conformance@188

commit: 85fe6b7

panyam · 2026-05-06T00:49:35Z

Hello - saw this PR while looking at the 2322 finalizing threads. I've been porting our local MRTR + Tasks Extension scenarios into a fork of the official suite at panyam/mcpconformance:feat/tasks-mrtr-extension - looks like our ephemeral-flow scenarios cover similar ground to your A1-A7 set, and we've also built out the wider Tasks Extension perimeter (lifecycle, capability negotiation, dispatch, request-state, headers, notifications) which this PR doesn't span.

The bridge scenario (Tasks + MRTR partial fulfillment) is narrow on our side - 3 checks - vs your incomplete-result-tasks.ts which goes deeper. Looks like the two are mostly complementary.

If you're planning to revive this PR after the SEP finalizes, happy to help refresh wire format and pair on the bridge surface. Otherwise I can open a separate PR for the wider Tasks Extension scope and defer the ephemeral-flow / bridge depth to whatever lands here. Or some merged form, whatever's easiest for you. Just wanted to make sure I wasnt undoing anything 🙏

Mrtr tests 5 14 update

* add missing tests and update sep-2322.yaml

CaitieM20 · 2026-05-21T02:49:06Z

Hey @panyam would love some help here. I've updated this PR now that 2322 is Approved and checked in. I removed the tasks tests and marked the checks as excluded since we have moved Tasks to an extension.

I saw you opened one for the Task extension which I think is the right path forward for the Task Conformance tests. Appreciate the help.

Hello - saw this PR while looking at the 2322 finalizing threads. I've been porting our local MRTR + Tasks Extension scenarios into a fork of the official suite at panyam/mcpconformance:feat/tasks-mrtr-extension - looks like our ephemeral-flow scenarios cover similar ground to your A1-A7 set, and we've also built out the wider Tasks Extension perimeter (lifecycle, capability negotiation, dispatch, request-state, headers, notifications) which this PR doesn't span.

The bridge scenario (Tasks + MRTR partial fulfillment) is narrow on our side - 3 checks - vs your incomplete-result-tasks.ts which goes deeper. Looks like the two are mostly complementary.

If you're planning to revive this PR after the SEP finalizes, happy to help refresh wire format and pair on the bridge surface. Otherwise I can open a separate PR for the wider Tasks Extension scope and defer the ephemeral-flow / bridge depth to whatever lands here. Or some merged form, whatever's easiest for you. Just wanted to make sure I wasnt undoing anything 🙏

panyam · 2026-05-21T04:01:21Z

@CaitieM20 Thanks - and yep, totally agree with the split. PR 262 (the Task extension side) is now in a position to be merged. Luca has signed off and asked pcarleton to give it a final look. We're shipping it with two cross-SEP suites deliberately skipped (mrtr-tasks-composition and tasks-status-notifications via subscriptions/listen). Plans are there for a fast-follow for both. These both overlap naturally with the MRTR side here - the composition test in particular needs to encode the asymmetric requestState invariant (MRTR phase carries requestState, Task phase forbids it), which only lands cleanly once both PRs are in.

So when this one merges I can bring on the composition harness against whatever fixture shape you settle on. Will review the refreshed diff here in the meantime and surface anything from the Task-side experience that's worth folding in - but overall thanks for all the updates. Looking great and cant wait for it!

* remove duplicates in index, rename test cases to be consistent * add negative tests * refactor into everything-server and update negative tests * fix conformance tests

…quirement The traceability schema recognizes only check/text/url/issue/excluded on a requirement row. The 'note:' field on the scenario-gate rows was silently dropped, so those 11 rows would have been ingested as ordinary requirement rows whose text is not a spec sentence, inflating the SEP-2322 requirement count on the traceability dashboard. - Remove the 10 flow-gate rows (sep-2322-*-complete, sep-2322-multi-round-r*, sep-2322-non-tool-*) plus sep-2322-multiple-inputs-incomplete. The checks are unchanged and still emitted by the scenarios; their IDs now surface in the manifest's 'untracked' list, which is the designed home for scenario scaffolding that doesn't map to an RFC-2119 sentence. - Move 'inputRequests keys ... MUST be unique' to the excluded list: duplicate JSON object keys are collapsed by the parser before the harness can observe them, so the requirement is not testable at the protocol level. The check previously paired with it actually verifies that the server returns three inputRequests of different method types, which is a flow gate, not a key-uniqueness test.

The spec says the client MUST echo back the exact value of requestState and MUST NOT inspect, parse, or modify it. The check previously parsed the returned state as JSON and compared two fields, so a client that deserialized the state and re-serialized it (different key order, whitespace, extra fields) would still pass despite having modified the opaque value. Store the exact string the mock server sent and compare the echoed value with strict string equality instead. Also include the sent value in the check details so a mismatch is diagnosable from the report.

pcarleton

LGTM!

Left 2 small tweaks in follow-up commits, i'll optimistically merge, but lmk if you disagree with either

* chore: refresh SEP traceability manifest (typescript-sdk@main) Regenerated from a client+server suite run against typescript-sdk@5fc42e9be115 following the recipe in .github/workflows/traceability.yml. New entries since the last refresh (typescript-sdk@22595b96): - SEP-2322 (MRTR, #188): 17 tested, 0 untested, 16 excluded, 3 untracked - SEP-2549 (TTL for list results, #275): 7 tested, 0 untested, 13 excluded - SEP-2260: 12 excluded rows, no checks - SEP-2207: yaml rows added since the last refresh now appear (1 tested, 1 untested: sep-2207-server-no-offline-access) No previously-tested requirement regressed. * Exclude sep-2207 server offline_access guidance until RS auth scenarios exist sep-2207-server-no-offline-access was declared in the yaml but no scenario emits it, so it surfaced as the only untested requirement in the refreshed manifest. The check needs to probe the SDK server's Protected Resource Metadata scopes_supported and WWW-Authenticate challenge scope, and the server suite does not yet exercise the SDK server as an OAuth protected resource at all. Mark the requirement excluded with a pointer to #116 (server-side authorization baseline) rather than leaving it as a permanently-untested row; revisit when server-side authorization scenarios land.

CaitieM20 added 11 commits March 13, 2026 11:28

initial agent suggested changes + refactoring

6e81e86

initial agent suggested changes + refactoring

c481e98

add ListRoot Tests

ab655cb

Remove duplicate test

3d04a7c

rename tests

b78e783

update rawMCP connection logic to not use sessionId, remove unnecessa…

78032aa

…ry wrapper class

update all tests to include all types of inputRequests

b8621d4

removing duplicative tests

85ef0af

removing duplicative tests

9d86c97

removing duplicative tests

d446a35

Fix formatting

a8fa736

CaitieM20 assigned CaitieM20, felixweinberger and maxisbey Mar 17, 2026

CaitieM20 added the enhancement New feature or request label Mar 17, 2026

CaitieM20 mentioned this pull request Apr 7, 2026

SEP-2322: Multi Round-Trip Requests modelcontextprotocol/modelcontextprotocol#2322

Merged

9 tasks

CaitieM20 added 7 commits May 14, 2026 16:35

Merge branch 'main' into mrtr-tests

71d682a

Agent WIP

23226ed

fix formatting

1ae8855

remove session id and cleanup resultType Checks

7324cba

remove tasks

e203295

formatting

d22b308

Merge branch 'main' into mrtr-tests

5b5c02f

CaitieM20 added 9 commits May 20, 2026 12:02

switch to using RPC

76b8874

remove negative tests, returning a complete result is normal behavior.

d1ea13e

update client tests

bd1dc47

style checks

742a37c

fixing CI issues

7c7c21c

Merge pull request #1 from CaitieM20/mrtr-tests-5-14-update

e0c5fbf

Mrtr tests 5 14 update

revert client-helper.ts changes no longer needed

a53faf2

revert client-helper no longer needed with sessionless change

f56d110

revert client-helper.ts

e76270c

CaitieM20 commented May 20, 2026

View reviewed changes

Comment thread src/scenarios/server/lifecycle.test.ts Outdated

CaitieM20 added 2 commits May 20, 2026 14:21

Apply suggestion from @CaitieM20

774d53b

Mrtr audit (#2)

a041f5c

* add missing tests and update sep-2322.yaml

CaitieM20 requested a review from pcarleton May 21, 2026 02:46

CaitieM20 enabled auto-merge (squash) May 21, 2026 02:49

CaitieM20 disabled auto-merge May 21, 2026 04:09

CaitieM20 added 3 commits May 20, 2026 21:42

Mrtr tests updates (#3)

a5e971f

* remove duplicates in index, rename test cases to be consistent * add negative tests * refactor into everything-server and update negative tests * fix conformance tests

fix bugs and align naming

756a125

don't hardcode port in negative test server

41140e3

CaitieM20 enabled auto-merge (squash) May 21, 2026 15:28

CaitieM20 requested a review from felixweinberger May 21, 2026 18:08

CaitieM20 and others added 4 commits May 21, 2026 16:42

Merge branch 'main' into mrtr-tests

f5a0d31

fix merge

8a6a671

pcarleton approved these changes May 22, 2026

View reviewed changes

CaitieM20 merged commit 43fbf60 into modelcontextprotocol:main May 22, 2026
4 checks passed

pcarleton mentioned this pull request May 22, 2026

chore: refresh SEP traceability manifest after SEP-2322 (#188) #301

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conformance Tests for SEP-2322 MRTR#188

Conformance Tests for SEP-2322 MRTR#188
CaitieM20 merged 40 commits into
modelcontextprotocol:mainfrom
CaitieM20:mrtr-tests

CaitieM20 commented Mar 17, 2026

Uh oh!

pkg-pr-new Bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

panyam commented May 6, 2026

Uh oh!

Uh oh!

CaitieM20 commented May 21, 2026

Uh oh!

panyam commented May 21, 2026

Uh oh!

pcarleton left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

CaitieM20 commented Mar 17, 2026

Motivation and Context

How Has This Been Tested?

Breaking Changes

Types of changes

Checklist

Additional context

Uh oh!

pkg-pr-new Bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

panyam commented May 6, 2026

Uh oh!

Uh oh!

CaitieM20 commented May 21, 2026

Uh oh!

panyam commented May 21, 2026

Uh oh!

pcarleton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pkg-pr-new Bot commented Mar 17, 2026 •

edited

Loading